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Claims 

1. An intrinsic representation extraction rule generating system characterized by the 
following facts: the intrinsic representation extraction rule generating system performs computer 
processing to generate the rule for use in extracting the intrinsic representations from a document 
on the basis of the document for training data in a storage device beforehand and a correct 
answer list that lists what is contained as the intrinsic representations (correct answer intrinsic 

[Numbers in the margin indicate pagination in the foreign text.] 



representations) at what positions in the document for training for extracting what type of 
intrinsic representations; and it has the following means: a word type/character type attaching 
means, which reads said document for training from said storage device and divides it into words, 
attaches word type and structural character type to each word, generates a word row information 
that forms the intrinsic representations contained in said document for training and stores it in 
said storage device; a rule generating means, which reads the various correct answer intrinsic 
representations of said correct answer list from said storage device, compares them with the 
various word row information generated with said word type/character type attaching means, and 
generates the rule for extracting said correct answer intrinsic representations; a means for 
application of the rule for training, which reads said document for training and said rules from 
said storage device, applies said rules in said document for training, extracts the corresponding 
intrinsic representations (candidate intrinsic representations) and records them in said storage 
device; a rule evaluating means, which reads said candidate intrinsic representations and the 
correct answer intrinsic representations of said correct answer list from said storage device, 
compares them with each other, and computes the appropriateness of each rule used in extracting 
each candidate intrinsic representations on the basis of a prescribed computing sequence; a rule 
deleting means that deletes the rule with an appropriateness computed using said rule evaluating 
means lower than a prescribed appropriateness from said storage device; and a rule refining 
means that corrects the rule having the appropriateness computed using said rule evaluating 
means within a prescribed appropriateness range so as to increase its appropriateness and records 
the corrected rule in said storage device. 

2. The intrinsic representation extraction rule generating system described in Claim 1 
characterized by the fact that said rule generating means performs the following operation: when 
a word contained in the word row information read from said storage device is a numeral or a 
proper noun, or when the word is neither the word at the tail of said word row information nor 
any of the functional words including symbols, single kanji, tail connecting words, head 
connecting words, and particles, said word is converted to a variable, and a word row 
information containing variables is determined, and said rule is generated on the basis of said 
word row information containing variables and on the basis of said correct answer list. 

3. An intrinsic representation extraction rule generating system characterized by the 
following facts: the intrinsic representation extraction rule generating system performs computer 
processing to generate the rule for use in extracting the intrinsic representations from a document 
on the basis of the document for training data in a storage device beforehand and a correct 
answer list that lists what is contained as the intrinsic representations (correct answer intrinsic 
representations) at what position in the document for training for extracting what type of intrinsic 
representations; and it has the following means: a word type/character type attaching means, 
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which reads said document for training from said storage device and divides it into words, 
attaches word type and structural character type to each word, generates word row information 
that forms the intrinsic representation contained in said document for training and stores it in said 
storage device; and a rule generating means that performs the following operation: said word row 
information is read from said storage device; when a word contained in said word row 
information read from said storage device is a numeral or a proper noun, or when the word is 
neither the word at the tail of said word row information nor any of the functional words 
including symbols, single kanji, tail connecting words, head connecting words, and particles, 
said word is converted to a variable, and word row information containing variables is 
determined, and said rule is generated on the basis of said word row information containing 
variables and on the basis of said correct answer list. 

4. The intrinsic representation extraction rule generating system described in any of 
Claims 1-3 characterized by the fact that said rule generating means has a means that attaches to 
the generated rule a priority of the rule defined as the total number of rounds in which said 
intrinsic representation used in generating the rule appears in said correct answer list. 

5. A type of intrinsic representation extracting device characterized by the following 
facts: the intrinsic representation extracting device has the intrinsic representation extraction rule 
generating system described in any of Claims 1-4, and it can extract the intrinsic representations 
contained in any document by means of computer processing on the basis of the rule generated 
with said intrinsic representation extraction rule generating system; in this intrinsic 
representation extracting device, there is a means that performs the following operation: when 
there is a partial overlap between plural extracted candidate intrinsic representations, the 
candidate intrinsic representation having an earlier description start position in said any 
document is extracted with priority; if they have the same description start position, the 
candidate intrinsic representation having a later description end position is taken as priority in 
extraction; also, there is a means that performs the following operation: when the plural extracted 
candidate intrinsic representations are the same, the candidate intrinsic representation having a 
higher priority attached beforehand to said rule used in extracting said candidate intrinsic 
representation is taken as the priority in extraction. 

6. An intrinsic representation extraction rule generating method characterized by the 
following facts: in the intrinsic representation extraction rule generating method, computer 
processing is performed to generate the rule for use in extracting the intrinsic representations 
from a document on the basis of the document for training data in a storage device beforehand 
and a correct answer list that lists what is contained as the intrinsic representations (correct 
answer intrinsic representations) at what position in the document for training for extracting what 
type of intrinsic representations; and it has the following steps of operation: a first step in which 



4 



said document for training is read from said storage device and is divided into words, a second 

step in which the word type and structural character type are attached to each divided word to 

generate a word row information that forms the intrinsic representation contained in said 

document for training; a third step in which the various correct answer intrinsic representations 

of said correct answer list are read from said storage device and are compared with the various 

word row information generated in said second step to generate the rule for extracting said 

correct answer intrinsic representation; a fourth step in which said document for training and said 

rules are read from said storage device, said rules are applied in said document for training, and 

the corresponding intrinsic representation (candidate intrinsic representation) is extracted and 

recorded in said storage device; a fifth step in which said candidate intrinsic representation and 

said correct answer intrinsic representation of said correct answer list are read from said storage 

device and they are compared with each other, and the appropriateness of each rule used in 

extracting each candidate intrinsic representation is computed on the basis of a prescribed 

computing sequence; a sixth step in which the rule with an appropriateness computed using said 

rule evaluating means lower than a prescribed appropriateness is deleted from said storage 

device; and a seventh step in which the rule having the appropriateness computed using said rule /3 

evaluating means within a prescribed appropriateness range is corrected so as to increase its 

appropriateness, and the corrected rule is recorded in said storage device. 

7. The intrinsic representation extraction rule generating method described in Claim 6 
characterized by the fact that said third step has the following steps of operation: a step in which 
the following operation is performed: when a word contained in the word row information read 
from said storage device is a numeral or a proper noun, or when the word is neither the word at 
the tail of said word row information nor any of the functional words including symbols, single 
kanji, tail connecting words, head connecting words, and particles, said word is converted to a 
variable, and a word row information containing variables is determined, and a step in which 
said rule is generated on the basis of said word row information containing variables and on the 
basis of said correct answer list. 

8. The intrinsic representation extraction rule generating method described in Claim 6 or 
7 characterized by the following facts: said fourth step has a step in which the description 
position information of said candidate intrinsic representations in said document for training and 
the identification information of rule used in extracting said intrinsic representations are attached 
to said candidate intrinsic representations; said fifth step has the following steps: a step in which 
said candidate intrinsic representations and said correct answer list are read from said storage 
device and are compared with each other, and said extracted candidate intrinsic representations 
are classified to candidate intrinsic representations (intermediate candidate intrinsic 
representations) that are not in said correct answer list yet have their output suppressed by the 
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other correct answer intrinsic representations in said correct answer list, and candidate intrinsic 
representations (non-correct answer candidate intrinsic representations) that are not in said 
correct answer list and have their output not suppressed by the other correct answer intrinsic 
representations in said correct answer list, and a step in which for each rule used in extraction of 
the candidate intrinsic representations, the number of said correct answer intrinsic 
representations extracted with said rule and the number of said non-correct answer candidate 
intrinsic representations are counted; in said sixth step, the rule for which the number of said 
non-correct answer candidate intrinsic representations with respect to the number of said 
corrected answer candidate intrinsic representations is over a prescribed standard Tl is deleted 
from the rule group generated in said fourth step; in said seventh step, the rule for which the 
number of said non-correct answer candidate intrinsic representations with respect to the number 
of said corrected answer candidate intrinsic representations is lower than a prescribed standard 
T2 is corrected so that said number of the non-correct answer candidate intrinsic representations 
is reduced. 

9. The intrinsic representation extraction rule generating method described in any of 
Claims 6-8 characterized by the following facts: in said fifth step, plural candidate intrinsic 
representations are read from said storage device with the same rule, they are classified into 
candidate intrinsic representations (correct answer candidate intrinsic representations) that are in 
agreement with said corrected answer intrinsic representations, candidate intrinsic 
representations (non-correct answer candidate intrinsic representations) that are not in agreement 
with said corrected answer intrinsic representations, candidate intrinsic representations 
(intermediate candidate intrinsic representations), and candidate intrinsic representations 
(intermediate candidate intrinsic representations) that are not in agreement with said correct 
answer intrinsic representation yet have their output suppressed with other said corrected answer 
candidate intrinsic representations, and computes said appropriateness of said corrected answer 
candidate intrinsic representations and non-correct answer candidate intrinsic representations on 
the basis of their numbers; in said seventh step, for each candidate intrinsic representation 
extracted by applying said rule (original rule) with said appropriateness in the prescribed 
appropriateness range, in said document for training, the words before and after it as well as the 
character types and word type of the words are determined, and on the basis of said words before 
and after the candidate intrinsic representation as well as the character type and word type of the 
word, a restricting condition, which ensures that said non-correct answer candidate intrinsic 
representation contained in each said candidate intrinsic representation is not extracted, is 
generated and added to said original rule. 

10. An intrinsic representation extraction rule generating method characterized by the 
following facts: the intrinsic representation extraction rule generating method is adopted to 
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perform computer processing to generate the rule for use in extracting the intrinsic 
representations from a document on the basis of the document for training data in a storage 
device beforehand and a correct answer list that lists what are contained as the intrinsic 
representations (correct answer intrinsic representations) at what position in the document for 
training for extracting what type of intrinsic representations; and it has the following steps of 
operation: a first step in which said document for training is read from said storage device and is 
divided into words; a second step in which the word type and structural character type are 
attached to each word to generate a word row information that forms the intrinsic representations 
contained in said document for training, and it is recorded in said storage device; and a third step 
in which the following operation is performed: when a word contained in the word row 
information read from said storage device is a numeral or a proper noun, or when the word is 
neither the word at the tail of said word row information nor any of the functional words 
including symbols, single kanji, tail connecting words, head connecting words, and particles, 
said word is converted to a variable, and a word row information containing variables is 
determined, and said rule is generated on the basis of said word row information containing 
variables and on the basis of said correct answer list. 

1 1 . The intrinsic representation extraction rule generating method described in any of 
Claims 6-10 characterized by the fact that in said third step, a priority of the rule defined as the 
total number of rounds in which said intrinsic representation used in generating the rule appears 
in said correct answer list is attached to the generated rule. 

12. A type of recording medium characterized by the following facts: the recording 
medium is for recording a program in a computer readable manner, with said program describing 
the processing of the method for generating the rule for use in extracting the intrinsic 
representations from a document on the basis of a document for training data in a storage device 
beforehand and a correct answer list that lists what is contained as the intrinsic representations 
(correct answer intrinsic representations) at what position in the document for training for 
extracting types of intrinsic representations. 

Detailed explanation of the invention 
[0001] 

Technical field of the invention 

The present invention pertains to a technology for extracting the intrinsic representation 
contained in a document by means of a computer. Especially, the present invention pertains to an 
intrinsic representation extraction rule generating system and its method that can be used 
preferably in generating the rule for extracting the intrinsic representations at a high efficiency, /4 
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as well as a type of recording medium for recording the processing program for said system and 
method, and a type of intrinsic representation extracting device. 

[0002] 
Prior art 

In order to answer inquiries regarding the information contained in a large document, to 
make a summary of the document, to form a data base of the document or to visualize the 
document, it is necessary to extract the intrinsic representations, such as personal names, 
addresses, institution names, date/time, etc., from the document. In this case, one can make use 
of a computer to prepare a glossary that has the various intrinsic representations registered in it, 
and, by searching the glossary, one can perform extraction of the intrinsic representations from 
the document. 

[0003] 

However, the actual document may contain new words that are not included in the 
glossary prepared beforehand. Consequently, searching in the glossary only may not give a 
correct extraction result. In order to cope with this problem, there is the following technology: 
plural rules that can regulate the appearing pattern of the order of the intrinsic representation 
itself and the words before and after it are prepared manually beforehand; on the basis of the 
rules, computer processing is performed to extract the intrinsic representation from' the document 
as the object. 

[0004] 

However, in this technology, the rules compete with each other and interact with each 
other. Consequently, the rules may not work as intended. As a result, the prepared rule has to be 
applied on certain training data prepared beforehand, and, on the basis of the result, if any error 
is observed, the rule is corrected. This operation is repeated in several rounds. 

[0005] 

However, as a result of correction of certain rule, the rules that used to operate normally 
may be affected, and erroneous answers may be given in many cases. Consequently, in order to 
have the plural rules all work as intended, a tremendous amount of labor is required. 

[0006] 

Even in the technology in which said rules for extracting the intrinsic representation are 
automatically generated using a computer, due to the competition and interaction between the 
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rules, a combination of said automatically generated rules is required to realize a good result, and 
this rule should be applied again, and the results are compared with the correct answer to be 
assessed. On the basis of the results, rules are added or deleted so as to get better results in 
repeated trial-and-error operation. This, however, requires a long computing time. 

[0007] 

Problems to be solved by the present invention 

The problems to be solved are as follows: In the prior art, it is impossible to generate the 
rule for extracting the intrinsic representation contained in the document at a high precision, and 
in order to generate a better rule (rule for extracting the intrinsic representation), each time as the 
combination of the rules is corrected, it is applied on the practical document, and the result is 
compared with the correct answer to be graded, and trial-and-error is performed for the 
combination of the various rules. As a result, a huge computing time is needed, and this is 
undesirable. 

[0008] 

The purpose of the present invention is to solve the problems of the prior art by providing 
a type of an intrinsic representation extraction rule generating system and method that allow 
generation of high-precision intrinsic representation extracting rules easily in a short time and 
allow correct extraction of the desired intrinsic representations from a large document, as well as 
a recording medium that records the processing program and the intrinsic representation 
extracting device. 

[0009] 

Means to solve the problems 

In order to realize the aforementioned purpose, in the intrinsic representation extraction 
rule generating system and method of the present invention, first of all, a document for training 
prepared beforehand is subjected to morphological analysis and is divided into words, and the 
information regarding the word type and structural character type, etc. is attached to each word. 
From the word row obtained in this way, the word row that forms the intrinsic representation is 
fetched, and by taking reference to the correct answer list prepared beforehand corresponding to 
the document for training, plural rules for extracting intrinsic representations are generated by 
means of empirical rules, minimum generalization, and other generalization means. Then, these 
rules are applied independently to the document for training, and the data regarding where the 
position in the document for training matches the rule are stored. These data become candidates 
of the intrinsic representation output from the system with respect to the document for training. 
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When plural rules are combined, from all of the candidates included in the data corresponding to 
said rules, the finally output candidate row is selected with a prescribed clear standard in 
consideration of the competitive relationship and the priority order. As a result, when a rule has a 
high frequency of non-correct answers or a very large proportion of said non-correct answers in 
the document for training, the rule is deleted. In this case, the word row before and after the 
correct answer site is compared with the word row before and after the non-correct answer site 
and a restriction is applied. As a result, it is possible to make a judgment on whether a rule with 
good results in the document for training is formed. Consequently, when the result is good, a rule 
with restriction applied on it is adopted. 

[0010] 

In addition, the intrinsic representation extracting device of the present invention has the 
intrinsic representation extraction rule generating system described above with the following 
features: it can extract the intrinsic representation in any document on the basis of the rule 
generated with said intrinsic representation extraction rule generating system. Also, when there is 
a partial overlap between plural extracted candidate intrinsic representations, the candidate 
intrinsic representation having an earlier description start position in said any document is 
extracted with priority; if they have the same description start position, the candidate intrinsic 
representation having a later description end position is taken as priority in extraction; also, when 
there are plural extracted candidate intrinsic representations with the same representation but of 
different types, the candidate intrinsic representation having a higher priority attached 
beforehand to said rule used in extracting said intrinsic representation is taken as the priority in 
extraction. 

[0011] 

Embodiments of the invention 

In the following, the embodiments of the present invention will be explained in detail 
with reference to figures. 

[0012] 

Figure 1 is a block diagram illustrating an example of the constitution of the intrinsic 
representation extraction rule generating system of the present invention and the intrinsic 
representation extracting device having said intrinsic representation extraction rule generating 
system set in it. Figure 2 is a block diagram illustrating an example of the hardware constitution 
of the intrinsic representation extraction rule generating system and the intrinsic representation 
extracting device shown in Figure 1. 



[0013] 

In Figure 2, (21) represents a display device made of CRT (cathode ray tube), LCD 
(liquid crystal display), etc.; (22) represents an input device made of a keyboard, a mouse, etc.; 
(23) represents an external storage device made of HDD (hard disk drive) or the like; (24) 
represents an information processing device having (central processing unit) (24a), principal 
memory (24b), etc. and performing computer processing using the storage program system; (25) 
represents an optical disk made of CD-ROM (compact disk-read only memory) or DVD (digital 
video disk/digital versatile disk) or the like for recording the program and data pertaining to the 
present invention; (26) represents a driver for reading the program and data recorded on optical 
disk (25); and (27) represents a communication device made of LAN (local area network) card, 
modem, etc. 

[0014] 

After the program and data stored in optical disk (25) are installed in external storage 
device (23) via driver (26) by means of information processing device (24), they read from 
external storage device (23) to principal memory (24b), and are processed with CPU (24a). In 
said information processing device (24), there are both the intrinsic representation extraction rule 
generating system and the intrinsic representation extracting device having said intrinsic 
representation extraction rule generating system shown in Figure 1. 

[0015] 

In the intrinsic representation extracting device shown in Figure 1, document for training 
(1), correct answer list (2), intrinsic representation extraction rule group (5), improved intrinsic 
representation extraction rule group (5a), training data (7), novel document (1 1), and list (13) of 
the extracted intrinsic representations are stored in external storage memory (23) and principal 
memory (24b) shown in Figure 2. Also, morphological analysis/word type and character type 
attaching part (3), rule generating part (4), rule application part for training (6), rule evaluating 
part (8), rule deleting part (9), rule refining part (10), and rule application part for execution (12) 
are formed in information processing device (24) on the basis of the program stored in CD-ROM 
(25) shown in Figure 2. 

[0016] 

Said morphological analysis/word type and character type attaching part (3), rule 
generating part (4), rule application part for training (6), rule evaluating part (8), rule deleting 
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part (9), and rule refining part (10) form the intrinsic representation extraction rule generating 
system of the present invention. 

[0017] 

In morphological analysis/word type and character type attaching part (3), document for 
training (1) is divided into words, and information regarding the word type and the structural 
character type is attached to each word. 

[0018] 

In rule generating part (4), the word row obtained in the processing of morphological 
analysis/word type and character type attaching part (3) is compared with the data of the intrinsic 
representation to be extracted and given by correct answer list (2), and the word row that forms 
the intrinsic representation is fetched and generalized to generate a rule. The result is recorded as 
intrinsic representation extraction rule group (5) in external storage memory (23) in Figure 2. 

[0019] 

In rule application part for training (6), intrinsic representation extraction rule group (5) 
obtained as the result of generation of rule generating part (4) is applied in document for training 
(1). The result is recorded as data for training (7) in external storage device (23) in Figure 2. 

[0020] 

Rule evaluating part (8) evaluates the rules on the basis of data for training (7). On the 
basis of the evaluation result of rule evaluating part (8), rule deleting part (9) deletes the rule 
with poor results. Rule refining part (10) refines the rule so that the results become better. 

[0021] 

Rule application part for execution (12) applies the improved intrinsic representation 
extraction rule group (5) (improved intrinsic representation extraction rule group (5 a)) on actual 
novel document (1 1) to obtain intrinsic representation list (13). 

[0022] 

For both rule application part for training (6) and rule application part for execution (12), 
the rule group is applied on the document to extract the intrinsic representation, and the 
processing contents are nearly the same. Consequently, it is possible to have both of them in the 
same device. Also, in rule application part for execution (12), there is no need to leave data for 
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training (7). However, it is necessary to perform selection of the final candidate. This is a point 
of difference. 



[0023] 

First, an explanation will be given regarding the operation of rule application part for 
execution (12), that is, the operation as an intrinsic representation extracting device using 
intrinsic representation extraction rule group (5) generated and improved with the intrinsic 
representation extraction rule generating system and improved intrinsic representation extraction 
rule group (5 a). 

[0024] 

Rule application part for execution (12) applies improved intrinsic representation 
extraction rule group (5a) for novel document (11) for which the intrinsic representation is to be 
extracted, and it extracts the intrinsic representations contained in the document and outputs 
intrinsic representation list (13). 

[0025] 

For example, suppose there is new document (1 1) "In Tanaka Taro Prize Selecting 
Committee. . .", the intrinsic representations in this document include name candidates of 
"Tanaka", "Taro", "Tanaka Taro", an object name candidate of "Tanaka Taro Prize 1 ', as well as 
an institution candidate of "Tanaka Taro Prize Selecting Committee". Usually, it is demanded 
that among said candidates, the longest one, that is, "Tanaka Taro Prize Selecting Committee", 
be extracted and output as the intrinsic representation. In this case, the other candidates (intrinsic 
representations) of "Tanaka" and "Taro" overlapped with said intrinsic representation should not 
be output. 



[0026] 

The relationship among the candidates can be reduced to the competition relationship due 
to overlap and the suppression relationship due to the priority sequence of the various candidates. 
That is, because "Tanaka Taro Prize Selecting Committee" overlaps "Tanaka" and other 
candidates, they compete with each other. It is possible to consider that as the long candidate 
"Tanaka Taro Prize Selecting Committee" has a high priority, the other shorter candidates are 
suppressed. 
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[0027] 

In this example, in rule application part for execution (12), on the basis of said 
consideration, first of all, all of the rules are adopted on the document, and a collection of all of 
the candidate intrinsic representations (including "Tanaka", "Taro", "Tanaka Taro", "Tanaka 
Taro Prize", "Tanaka Taro Prize Selecting Committee", etc.) is determined. Then, among said 
candidates, the longest candidate ("Tanaka Taro Prize Selecting Committee" among said 
candidates) of those having the same intrinsic representation ("Tanaka" in said candidates) is 
output. 

[0028] 

As one candidate is output, all of the other candidates ("Tanaka", "Tanaka Taro", 
"Tanaka Taro Prize") are deleted from the collection of the candidates. The aforementioned 
operation is performed repeatedly until the collection of candidates becomes empty. In this way, 
intrinsic representation list (13) is obtained. 

[0029] 

However, when only the length is taken into consideration, it is difficult to judge whether 
there are plural candidates having the same length by only performing judgment of selection 
from the various competing candidates. For example, "Whitehouse" may be taken as an address 
and an institution name. Consequently, the same character row "Whitehouse" is taken as both a 
candidate of address and a candidate of institution. 

[0030] 

In this case, for the two candidates, a priority order for extraction is set. For example, in 
consideration of the word before and after it, for "In the park near Whitehouse. ..", there is a high 
probability that it is an address. On the other hand, in "According to Whitehouse,. ..", it is quite 
possibly an institution name. Also, when the appearance frequency is taken into consideration, if 
there is only once when "Whitehouse" appears once in document for training (1), and there are 
20 rounds in which it appears as an institution name, the possibility is high that it is judged as an 
institution name. 

[0031] 

In this example, a priority with said conditions taken into consideration is attached to 
each rule in improved intrinsic representation extraction rule group (5a). 
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[0032] 

Rule application part for execution (12) combines such priority with said length of the 
intrinsic representation, and computes the priority order for each candidate. It is believed that 
there are various options in setting the priority order. However, as explained above, among those 
having the earliest start position and among those having the latest end position, it is clear that 
the candidate having the highest priority should be selected. That is, for the priority relationship 
of the candidates, the following definition is the basis. 

[0033] 

[1] If the start position of candidate A is earlier than that of candidate B (that is, a smaller 
numeral), candidate A has the priority. 

[2] If the start position of candidate A is the same as that of candidate B, the candidate 
having the later end position (that is, a larger numeral) has the priority. 

[3] When two candidates have the same start position and the same end position, the 
candidate having a larger priority u given according to the rule beforehand is taken as having the 
priority. 

[0034] 

In the intrinsic representation extraction rule generating system of this example, intrinsic 
representation extraction rule group (5) that allows easy processing with said rule application 
part for execution (12) and improved intrinsic representation extraction rule group (5a) are 
generated. In the following, an explanation will be given regarding the operations of the various 
parts that form the intrinsic representation extraction rule generating system pertaining to the 
generation processing of the rules with said priority relationship taken into consideration. 

[0035] 

First of all, in morphological analysis/word type and character type attaching part (3), the 
document is divided into words. The document, such as document for training (1) and new 
document (1 1), etc., having a typical morphological analysis function is divided into words. The 
word type and the type of the characters that form the word (structural character type 
information) are attached to each word to form a data structure, and a list is formed. 

[0036] 

For example, in the sentence "for Nakano, president of Tokyo Steel ....", results of 
morphological analysis indicate that 'Tokyo" is a unique noun; "Steel" is an ordinate noun; "of 1 
is a particle; "Nakano" is a unique noun; "president" is an ordinate noun; and "for" is a particle. 
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[0037] 

Also, "Tokyo" is composed of plural kanji characters, and "NO [of]" is a Japanese 
character. Consequently, morphological analysis/word type and character type attaching part (3) 
outputs a list with the following data structure for said sentence. "(Tokyo, plural kanji characters, 
unique noun), (Steel, plural kanji characters, ordinate noun), (of, Japanese character, 
particle),...". 



[0038] 

On the other hand, correct answer list (2) lists the type of the intrinsic representation and 
the position in document for training (1). For example, correct answer list (2) prepared 
beforehand corresponding to document for training (1), "for Nakano, president of Tokyo 
Steel,. . .", is composed of the following data. 



[0039] 



O 3 *Sra®tra® 

6 6 A€© 
20 23 3fl90©Bff0 

30 32 mm® *£© 



Key: 1 Tokyo Steel 

2 Institution name 

3 Nakano 

4 Person name 

5 March 9 

6 Date 

7 Okayama 

8 Place name 



[0040] 

In this list, in the first line, it is shown that "at the position from the 0 th character to the 3 rd 
character", "Tokyo Steel" of type of "institution name" is presented as an intrinsic representation. 
In the next line, "at the position from the 5 th character to the 6 th character", "Nakano" of type of 
"person name" is presented as an intrinsic representation. In correct answer list (2) of this 
example, the pair of numerals indicates the start position and end position of each intrinsic 
representation, and it gives a brief name indicating the position of the corresponding intrinsic 
representation. 
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[0041] 

In rule generating part (4), said correct answer list (2) is compared with the word row 
output from morphological analysis/word type and character type attaching part (3), and it 
converts the intrinsic representations into variables. As a result, for example, the following rule 
for extracting the intrinsic representation is generated. 

[0042] 

angtag (3) 4- <@(institution name, 21), word (_, plural kanji characters, unique noun), 
word (Steel, plural kanji characters, ordinary noun), >@ (institution name). 

[0043] 

According to this rule, the rule attaches number "21", and if there is any (in variable 
form) kanji unique noun ("word (_, plural kanji characters, unique noun)"), and the next word, 
"Steel" is an ordinary noun of plural kanji characters ("word (Steel, plural kanji characters, 
ordinary noun)"), these two words are taken as candidates of the intrinsic representation of 
"institution name". 

[0044] 

More generally, generation of said rule can be represented as follows. First of all, the 
intrinsic representation is composed of N+l words ^* WOt c<u po)> * ' (Wi > Ci * 
Pi) , • • • . (ww, cn , pi) ] Herej wi represents the word ("steel", "Nakano", etc.), ci 11 
represents the structural character type ("plural kanji characters", "numeral", etc.), and pi 
represents the word type ("unique noun", "ordinary noun", etc.). 

[0045] 

In practice, the several surrounding words are also an important means in judging 
whether [the representation] is an intrinsic representation. Consequently, they are usually taken 
into consideration as well. However, in this specification, in order to simplify the discussion, 
only the words contained in the intrinsic representation are taken into consideration. 

[0046] 

Then, from the word row, minimum generalization or other existing generalization 
technology is used to generate the rule. However, in the present example, generation is 
performed in a simple way as follows. 
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[0047] 

That is, the empirical rule to be explained later is applied on the specific word row 
[<w,, c. pa). - • ;/ (wi. ex. Pi ). - . (w». c. p.)] forms the intrinsic 
representation contained in document for training (1) to form a list 

[(*••. co'. po. . . ..(«:.'«•'. pn, • - *tm\ cu\ Pwrn cMBbsg 

variables, and the following rule is formed. 
[0048] 

anyiag(u) <— <@<t + d f , k) t wo 
rd<wo\ c*\ pO, - <wi\ ci*. pi*), 
- — , iwd(wn\ c*\ pk'), >®<t-dtK 

[0049] 

Here, "t" indicates the type of the intrinsic representation (such as "institution name"). 
"+df 1 indicates how many characters should the start position of the intrinsic representation be 
shifted to the right, and it is a non-negative integer smaller than the number of characters of the 
initial word. Also, "-dt" indicates how many characters should the end position of the intrinsic 
representation be shifted to the left, and it is a non-negative integer smaller than the number of 
characters of the last word. 

[0050] 

For example, there is document for training (1) of "in Atsugi-shi,. . .". Although 
"Atsugi-shi" in it is a place name according to correct answer list (2), in the morphological 
analysis of morphological analysis/word type and character type attaching part (3), when it is 
divided to words of "Atsugi", "shi", "in", the word row that forms the intrinsic representation 
becomes "(Atsugi, plural kanji characters, unique noun), (shi, plural kanji characters, ordinary 
noun)", and the fmal one character ("in") is redundant. Here, in order to shift the end position by 
one character to the left, one has "dt=l ". Also, because there is no shift for the start position, one 
has "df=0". 



[0051] 

Also, in said rule, "k" is a number attached to said rule, and "u" represents the priority of 
the rule. 
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[0052] 

Data 9 °* ■ pi ) containing various variables is obtained as follows: 
corresponding to the data * c 1 » corresponding to the specific intrinsic representation 
contained in document for training (1), the following empirical rule is studied sequentially from 
the upper side, and the first matched one is adopted. 

[0053] 

[1] When "i" is "0" or "N", and the boundary of the intrinsic representation is contained 
(df > 0 or dt > 0), they are not formed as variables. In this case, in the rule, the original values of 
"df 1 and "dt" with respect to the original intrinsic representation are used as it is. 

[2] For a numeral, "wi" is converted to a variable. 

[3] For a unique noun, "wi" is converted to a variable. 

[4] If the word is the last word of the list or a functional word, such as symbol, single 
kanji character, tail connecting word, head connecting word, particle, etc., no conversion to 
variable is performed. 

[5] In other cases, "wi" is converted to variable. 

[0054] 

By applying the aforementioned processing for the various intrinsic representations, it is 
possible to automatically generate intrinsic representation extraction rule group (5). 

[0055] 

Also, as priority (u) of each rule, for example, the "total number of rounds" with which 
the intrinsic representation that becomes the origin of the rule appears in the correct answer list is 
adopted. As a result, it is possible to avoid the following problem that a rule with a smaller 
correct answer round number ("Whitehouse" as a place name in said example) suppresses a rule 
having a larger correct answer round number ("Whitehouse" as an institution name) without any 
justified reason. 

[0056] 

By applying the various rules (intrinsic representation extraction rule group (5)) obtained 
with said rule generating part (4) in the word row of document for training (1) in rule application 
part for training (6) to obtain training data (7). That is, in rule application part for training (6), 
from the head to the tail of document for training (1), the positions where the rules match are 
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studied sequentially. When matched, it is taken as a candidate and is added to training data (7). 
This operation is repeated. 

[0057] 

For training data (7), more specifically, comparison is performed for the competition 
relationship and suppressing relationship between the various candidates, and the data of rule 
number (k), matched position, type of the intrinsic representation (t), etc. are recorded such that 
the final output can be obtained. 

[0058] 

The processing with said rule application part for training (6) is performed for all of the 
rules of intrinsic representation extraction rule group (5) to form training data (7). 

[0059] 

Also, by means of a bottom-up type text analysis scheme, it is possible to simultaneously 
obtain plural rule application results at a high efficiency. 

[0060] 

Rule evaluating part (8) reads training data (7) prepared in the above, and makes grading 
for the result of each rule. Various standards may be adopted as the specification for grading. A 
simple way is to make use of the evaluation by means of the number of rounds and proportion of 
the non-correct answer. However, more strictly speaking, the number of rounds of non-correct 
answer for each rule depends on the rules combined with it. Consequently, when the specific rule 
to be adopted has not yet been decided, it is impossible to get a correct numeral. In this case, 
records of the rules (R) are classified as follows for consideration. 

[0061] 

(O) The candidate obtained by matching with the intrinsic representation as the base of 
rule R, that is, the candidate surely becomes correct answer if not suppressed with other 
candidate (correct answer candidate intrinsic representation). 

(A) The other competing intrinsic representation is registered in correct answer list (2), 
and the candidate is suppressed by it. That is, if the intrinsic representation is not a correct 
answer, the output is suppressed, so that in the rule group with a high precision, the candidate has 
a high possibility without decrease in the result (intermediate candidate intrinsic representation). 

(x) The others. That is, because there is no suppressed correct answer intrinsic 
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representation, in the high-precision rule group, there is a high possibility that a wrong candidate 
is output and the result decreases (non-correct answer candidate intrinsic representation). 

[0062] 

In rule evaluating part (8), the number of rounds is counted for each of "O", "A" and "x" 
for each rule, and the number of rounds of "x M is adopted as the number of rounds of non-correct 
answer, and the number of rounds of "0" is adopted as the number of the rounds of the correct 
answer. Also, if all of "A" are taken as non-correct answer, the rule that extracts "Tanaka" or 
other short intrinsic representation becomes unfavorable. Consequently, this should be avoided. 
For this purpose, in rule evaluating part (8), the following method is adopted to count the rounds 
of the non-correct answer. 

[0063] 

That is, rule evaluating part (8) sequentially reads training data (7) from the former side, 
and rule R is applied at position L of document for training (1). The type of the intrinsic 
representation attached with rule R (classification of place name, person name, etc.) is T; the pair 
of type T and location L is not contained as a correct answer in correct answer list (2); and, in 
addition, the intrinsic representation of the correct answer either is not present at the position 
overlapped with location L, or, although it is present, if the candidate according to rule R is prior 
to the candidate corresponding to the correct answer, the number of rounds of the non-correct 
answer of rule R is increased by one. This operation is performed repeatedly until end of training 
data (7). 

[0064] 

Rule evaluating part (8) counts the numbers of "O", M A", "x" of each rule. With respect to 
this result, rule deleting part (9) and rule refining part (10) apply correction on intrinsic 
representation extraction rule group (5). 

[0065] 

In the rules of intrinsic representation extraction rule group (5), for example, rule deleting 
part (9) deletes the rules which have the number of "x" larger than that of n O". Rule refining part 
(10) performs the following operation: in the rules of intrinsic representation extraction rule 
group (5), for example, a restriction information pertaining to the words before and after it is 
added to the rules which have the number of "x" in the result larger than half the number of "0", 
so as to improve the results of the rules. 
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[0066] 

For example, suppose two words before the intrinsic representation and two words after 
the intrinsic representation are included for consideration, in each intrinsic representation 
extracted with said rule and classified by evaluation to "0" or "x", the word list of 

o> po), > (w**i, cn+i > p«+i>, (wH + a, c 3 i s considered. In this case, 

for each intrinsic representation, the characteristic list of ^ w ~*> c ~*» **~ z > 

w-i. c-i, p-u wio, cs*i. pn+u wh**, c « + *, PN < 2 ) i s considered. Suppose the 

positive case is for the intrinsic representation classified as "O", and the negative case is for the 
intrinsic representation classified as "x", it is a typical topic of inductive learning, and the 
existing machine learning scheme can be used as is. 

[0067] 

For example, by means of learning using a determining tree, among the several words 
before and after the [intrinsic representation], it is possible to determine the value of what 
property of what word to be left, while the remainder is to be converted to variables. As a special 
example, suppose "10" intrinsic representations classified to "x" are extracted, and, among them, 
"8" intrinsic representations have "wx" specified as the preceding word (w-1), as shown below, a 
restrictive condition ' w ~* * TO * is applied on the original rule, and restriction is made such 
that the intrinsic representation having "wx" is not extracted as the preceding word (w-1). 



[0068] 

anytag(u) <— word{w~i\ c-i\ p 
-rh <@(t+df . k), «ani<w>\ co\ po'). 
• * (wi f , ci\ pi'), - - *.word(wn'; 
c»\ p»*), >®(t-dt>, {w«r* yn) . 



[0069] 

For the rule obtained in this way, there is a strong restriction from the original rule. 
Consequently, matching takes place only for the portion identical to the portion that matches the 
original rule. Consequently, even when not adopted on the entirety of document for training (1), 
as long as it is applied only for the portion matched with the original rule left in training data (7), 
the results of the new rule can be understood. 
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[0070] 

In this example, improvement of the rule is performed almost independent from other 
rules. As explained above, rules with better results (improved intrinsic representation extraction 
rule group (5a)) are generated from the original rules (intrinsic representation extraction rule 
group (5)). 

[0071] 

Figure 3 is a flow chart illustrating an example of the processing process of the intrinsic 
representation extraction rule generating method pertaining to the present invention. 

[0072] 

In this example, in the intrinsic representation extraction rule generating system shown in 
Figure 1, the various processing operations of morphological analysis/word type and character 
type attaching part (3), rule generating part (4), rule application part for training (6), and rule 
evaluating part (8) are shown. First of all, in morphological analysis/word type and character 
type attaching part (3), document for training (1) is subject to morphological analysis, and it is 
divided into words (step (301)), and the information of the word type and character type, etc. is 
attached to each word (step (302)). 

[0073] 

Then, in rule generating part (4), the intrinsic representation of correct answer list (2) and 
the word row composed of the words near it are extracted (step (303)), the empirical rule or the 
like is applied on the correct answer word row to generate extracting rules (step (304)), and they 
are recorded as intrinsic representation extraction rule group (5)). 

[0074] 

In rule application part for training (6), the extracting rules generated in this way are 
applied to document for training (1), and the intrinsic representation obtained as a result is 
recorded as a candidate (step (305)). 

[0075] 

In addition, in rule evaluating part (8), the correct answer degrees (O, A, x) of the 
intrinsic representations extracted with the various extracting rules are determined and classified. 
On the basis of said operation, the appropriateness of each extracting rule is assessed (step (306)). 
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[0076] 

As the result of the grading, the rule group with a poor result that makes it not correctable 
(with a low appropriateness) is deleted in rule deleting part (9) (step (307)). Also, in rule refining 
part (10), said correction is applied on the rule group having a higher appropriateness by 
correction to form new rules (step (308)), and they are recorded as improved intrinsic 
representation extraction rule group (5a). By performing the processing from step (305) 
repeatedly, it is possible to generate a rule group with better results. 

[0077] 

Figure 4 is a flow chart illustrating an example of processing operation of the intrinsic 
representation extracting device shown in Figure 1 . In this example, in the intrinsic 19 
representation extracting device shown in Figure 1, the processing operation for new document 
(1 1) is shown. First of all, in morphological analysis/word type and character type attaching part 
(3), new document (1 1) is subjected to morphological analysis and it is divided into words (step 
(401)), and the influence of the word type and character type, etc. is attached to each word list 
(step (402)). 

[0078] 

Then, in rule application part for execution (12), in each word list, the various extraction 
rules of improved intrinsic representation extraction rule group (5) are applied, and the various 
intrinsic representations are taken as candidates for list-up (step (403)), and for all of the 
candidates, the following priority control processing is performed (step (404)). That is, the 
candidate with the highest priority in the candidates is output (step (405)), and the candidates that 
compete with said output candidate are deleted (step (406)). 

[0079] 

In the aforementioned intrinsic representation extraction rule generating system and 
method explained with reference to Figures 1-4, first of all, morphological analysis is performed 
for document for training (-1) prepared beforehand so that it is divided into words; the influence 
of the word type and structural character type, etc. is attached to each word; from the obtained 
words, the word row that forms the intrinsic representation is fetched; and, by means of the 
empirical rule, minimum generalization, or other generalizing means with reference to correct 
answer list (2) prepared corresponding to document for training (1) beforehand, plural intrinsic 
representation extracting rules are generated. 
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[0080] 

Then, the extracting rules are independently applied on document for training (1), and 
data indicating the position of document for training (1) where the rules match is prepared. This 
data represents the candidates of the intrinsic representation output from the system with respect 
to document for training (1). 

[0081] 

Then, when plural rules are combined, from all of the candidates that enter the records 
corresponding to the rules, the row of candidates to be finally output are selected with a 
prescribed clear standard in consideration of the competition relationship and the priority order. 
As a result, the rule that has a very high frequency of non-correct answers or a very high 
proportion of the non-correct answers in document for training (1) is deleted. It is known that the 
rule is a correct answer at a certain position of the document for training, and it is a non-correct 
answer at certain other positions of the document for training. By applying a restriction by 
comparing the word row before and after the correct answer site with that before and after the 
non-correct answer site, it is possible to judge whether a rule that has a good result in the 
document for training has been formed. Consequently, when the results are good, the rule with 
restriction on it is applied. 

[0082] 

In this example, when a document for training containing the intrinsic representations and 
a correct answer list that lists what type of intrinsic representation in what position in the 
document are given, the system can generate the intrinsic representation extracting rules on the 
basis of the correct answer, there is no need to write the extracting rules, this saving a great deal 
of labor. 

[0083] 

In addition, evaluation is performed for the various rules output with respect to document 
for training (1) prepared beforehand. Then, the evaluation value is determined for the various 
combinations of plural rules by means of simple computation from the evaluation values of the 
various individual rules. As a result, it is possible to shorten the processing time needed for 
trial-and-error performed during the process of determination of the combination of rules with 
good results. Also, improvement of the intrinsic representation extracting rules is performed 
almost independent from other rules, and the precision can be improved easily. 
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[0084] 

Also, in the intrinsic representation extracting device of this example, the rules generated 
and improved on the basis of the document for training and the correct answer list are applied on 
new document (11), and the intrinsic representations are automatically extracted from said new 
document (1 1). At the same time, if the extracted plural intrinsic representations are partially 
overlapped with each other, the intrinsic representation having an earlier description start 
position in said document is extracted with priority; if they have the same description start 
position, the intrinsic representation having a later description end position is taken as priority in 
extraction; also, when there are plural types of intrinsic representations having the same 
representation, the intrinsic representation having a larger priority attached beforehand to said 
rule used in extracting said intrinsic representation is taken as the priority in extraction. As a 
result, it is possible to perform extraction limited only to the appropriate intrinsic representation. 

[0085] 

The present invention is not limited to the example explained with reference to Figures 1- 
4. Various modifications can be made as long as the gist is observed. For example, in this 
example, when a restriction is attached to the rules, the restriction is set on the basis of the words 
before and after the candidate intrinsic representation in the document for training. However, it is 
also possible to set a restriction pertaining to the character type of the word (kanji character, 
Japanese character, . . .) and word type (noun, verb, . . .), etc. 

[0086] 

Also, in this example, optical disk (25) is used as the recording medium. However, one 
may also adopt FD as the recording medium. In addition, as far as installing of the program is 
concerned, it is also possible to go through communication device (27) to download the program 
via a network and then install it. 

[0087] 

Effect of the invention 

According to the present invention, the rules for extracting the intrinsic representations 
are automatically generated on the basis of the document for training prepared beforehand and 
the correct answer list that lists what type of intrinsic representations are included at what 
positions in the document. Consequently, there is no need to write down the labor-intensive 
extracting rules. In addition, by comparing the result of application of the automatically 
generated rules on the document for training with the correct answer list, it is possible to 
determine the appropriateness of each rule and to determine the appropriateness of a combination 
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of various rules on the basis of the appropriateness of each rule. Consequently, improvement of 
the intrinsic representation extracting rule can be performed almost independent from the other 
rules, it is possible to improve the precision easily, and it is possible to realize a 
high-performance intrinsic representation extracting device. 

Brief description of the figures 

Figure 1 is a block diagram illustrating an example of the constitution of the intrinsic 
representation extraction rule generating system and the intrinsic representation extracting device 
having said intrinsic representation extraction rule generating system set in it in the present 
invention. 

Figure 2 is a block diagram illustrating an example of the constitution of the hardware of 
the intrinsic representation extraction rule generating system and intrinsic representation 
extraction rule generating device shown in Figure 1 . 

Figure 3 is a flow chart illustrating an example of the processing process of the intrinsic 
representation extraction rule generating method in the present invention. 

Figure 4 is a flow chart illustrating an example of the processing operation of the intrinsic 
representation extraction rule generating device shown in Figure 1. 

Brief description of the reference numbers 
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Figure 3 

Key: a START 
b END 

301 Morphological analysis of document for training 

302 Attachment of word type and character type to obtained word row 

303 Extraction of correct answer intrinsic representation and the word row before and 
after it 

304 Application of empirical rule or the like on the correct answer word row to obtain 
extraction rule group 

305 Recording of candidates obtained by applying various extracting rules on the 
document for training 

306 Classification of candidates generated for each rule to O, A, X, etc., and 
assessment of the rule 

307 Deleting of the rule group with poor results 

308 Preparation of new rule group after correction of the rule group with poor results 
(* as needed, repeating from step (305)) 
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Key: a START 

b END 

401 Morphological analysis of the object document 

402 Attachment of word type and character type to the word list 

403 Application of each extracting rule on the obtained word list to obtain the 
candidate list 

404 Is the candidate list empty? 

405 Output of the candidate with the lowest priority in the generated candidate list 

406 Deletion of the candidates that compete with the output candidate from the list 
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■t&ztiz**). ^fr^<m&m.<mx*ftozt 10 

[00 03 J LfrU la^XSfcW. ?*>fflSLfc» 

[ooo4] Lift, ci^ttig-cii. mmm±#8&tL 20 

^fcfflgSiifcHBT^fcjBBLT. -extent** 
[000 5] **8II£«ELfcl£SL -Hi 

[0006] COJ:5*ffiSagS?&Jfta}^6«B!*3V 

mix. -e^SmS-iEIBkitSSL-caUSL. -fOJS* 

mn^sffimxhh. 

[0007] 40 

m. meymGxi*. zsz&itihmmt:^ 

tt&ilffit tiMLX&&L. &mo&&tcVlffif& 
Xf>t>. 

[0008] *mi<r>m\t. zti^mmmsi 
mi. m^mmm&mm&t:®%£fm » 
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iO^Xy- J*k J: V*<WmTafy A £ £3 1 
[0009] 

»>£r**g*i£jR9a}U mmmxm&tfBix^ib 
HMztixiimvAh zmiixmm^Mtt&t' 
o-mjtmzs. ^xm.<ms$3mmmm ou 

M&mmm&fflLx. tvmw. vmm&> 
if<rxasfcv-yf-t)t*H^»^i»eu-c*j<. z<n& 

vktt. Ktn. ^(omtmstfftxmk-^sviE 
mi. zcrytimx-ttmiz^x^z, tt*&bfr& . *z 

ns^i^uMLxummthzkiz^x. mem 
s«fcB»t sjsa^s < z&mimti.&frk'oipmm 

[ooio] s^fc. *^*o@w«saai^STii . 

QMnxm^cmtiZmttiitii-rhk&tz. mL*& 

mmmm&^hnz&ftLxmiu za. is» 
mmmtfm ixbhmsmTQMim^tvi&c 
ixm&i. $€>tc. $mmixbtmffirm%h 

-rs. 

[0011] 

[ o o 1 2 1 @ i hl *?rat:s-6©^^ttaiaBi^ 
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[0 0 13)B2(CiJWC, 2 lliCRT (Cathode Ba [0020) SH!IHlSB8li. PffifflgS7 fcg-^V vt 

y Tube) *lc d aiquid cretaiDispiay)^*^** mmmm-th. mmsmw^ mmmusnw 

**§gs. 2 2«*- #-F*wxm*t>%&*.j)$i wemtzm^x. fsmcomvmzfflvkti,. mm 

2 3ttHDD (Hard Disk Drive) *§>frt>%&tt& Oli, jSa^fi<*&i3fcfflH9i&IUS|tt-*. 

gfigS. 2 4 JiCPU (Central Processing Unit) [0 0 2 1 J SQfcJf8fiBISffi& 1 2t±. <Itf)J; ■? fcl/C 

Sfci&nytjL-^JSi^ff^dfi^MS^I. 25 «H!flSF5a) IS&0Sr«*31 K3UBI/CB&* 

ROM (Co-pact Disc-flead Only fewry) t> L < «D [0022] HMfflaHlJjSfflSd 6 fcHSfcfflfflfBtffiS 1 

VD (Digital Video Disc/Digital Versatile Disc) 10 2«V^fc. «HUSf LT®&Ha*i*{i5 

Wfrtthftr * X? , 2 6 fcfcjfcr 4X725 CSIiS -f & t>WC* 0 „ -t<0»ai^liH»^ tT&Sfctf). 

2 7«LAN (Local Area Network) #-F^Er ?£&ffiSi{?J®#JSlS 1 2tt. !WSlffiE87£»fY&g#$r 

[0014]3^X?25fc^$;rtfc7o^M3 6. 

•fctfT-*£flB8®i§@2 4fc X 9ffi»§£g2 6 £tfr [ 0 0 2 3 ] i"f. HMmSffliSJBS! 1 2«0»tK t^Sr 

^ia23*>t±^'j24bfcis»5i»cpu24a jas*ifc@«^fffl*aaiH!«S5. iSfiaffl*s®iaajffl 

B*^w*a^*«flij£$n5. [0024] mmmmm® 1 21*. B^^i^aas 

[oo i5] Bi<oB£»sjters§®fc£vva±. §8$ Lfcv^»as«i nc*tLt\ isammmmiiitm 

sa@*^*ai«B!i55a. i«tffla»7, trass xm*mvxhi 3*mD-r&. 

1 1 . Jsitf. ttaj3*ifc®*^S<0U x M 3*>**t [0025] Hilf . rffl4»±S5Sg^«£Ttt. • 

**Ui. H2fcijit*^S£t^S2 3t,L<Jii> ; e ■ ■ j k^o&mxmi i#*>zt?&t. z<vxm> 

924bm*3mzti.tte.j&sm!m timxm <m&mtix. rea+j. r*ssj. rffl+*aj 

ft^353fc, a$i4j£»4. mmmmm&e. m k^oA^mmk. rffl+±asftj k\^KEto&> 

imam. *ums9. sjws^i o. latwaHo ms. s^c. rffl*±fiPfrg#£fi£j hv^ams^ 

j#B£12<0*;rt«Ui. B2fc£»7&CD-ROM2 30 HRttt, -e<?5l«rC-^fi^ 
5«Stt3tttro^7^C»^#fi^Sa3^ffl2 4 rffi«fc*fim>Sf§Sfl£ j fcWSITSSli: LTttffi? 

[ooi6)^lt. mmffi ■ smxsmt*i-&3 tLtmc^x^h m*i**ttittk'(m<rxm 

k. ajttjs»4 . ummmsmmme . mmm& (mmi) imvizith^x*^ 

8. mm&9^ mmmi o<o*ti?ti&*mi [0026] z<oxo^mmmsktt. azotes 

s. jct&zkirezz. oio. rH«f*a5SS^f?M 

[0017] JBSaSKffi - fiMS^Sft4a53tt. P« ^ s \i r ffl+ j *k'^)ffiO«tlitS^->TV^*5fc«)fc 

^gwif^^fHnrtS. 40 <.mw<?)immfflLT^hk%l6ZktP?Z 

loo is) m£&mit.Mmm-sim:>m s. 

ft^3^!UlT#^iL&*^J5rjQByxh2-C*t [0027] *HtCiSVvaj. UtefflftHifflffiai 1 2 

■C«l^^W-&. ^^^Sff^SaiajSBMSFSt ^r^a5j^ rH+^j, rrjaifcypej, r H** 

[0019] patfflfflBflaffl96i±. sH"tt^a!4^ ««nsio+-cH tmm (±0%m£*s»x\± w 

5^S^g2 3t=l2»$*tS. 50 S. 
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10 0 28] Z<r>*otzLX— oomWVtttfitLh 
0£1MfcfcJ:0. B**Ktf>t>*M3*m*i.S. 

[00 29]fc<fU znxoiz&ztzvzmaLx. 

[0 0 30 J * £<02o<Oi£lf§*>|Sfc:. «KB^-* 
MtLT. r-&V4\>>\*?x<r>&<<r)$SgiX'- • • j T* 

*fcLtamLTV>£tf>#2 0EIfc-«lfcr. 20 
[ 0 0 3 1 ] *Wtt i. iStfiaE^fSSBli!ttB«B(» 5 a 

mm^mno^tim-^h^x. &m 

[0 0 33)OM^Acoi3i&m&H6aSB^i34&ai:j: 

tr. &imim^®&kLx*.%\i)m&&9tiiti 

6. 

[0034] ^wat^aftajaBn&s^xxAT 
XTj*zm&hmmft&o\*xwmz . 

[ 0 0 3 5] *-f . BJS&Htf • «&R*¥mtt4S53fc: 
ffHEt*L INfliJMl- ( fc»r«fcff 1 l£i:*<W. 50 
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[oo36i«xtf. rmm&<r»m#.m- • j 

PI r S&j ligj&feH.. r<»5.i OJffiSl r+»j <iH 
[ 0 0 3 7 ] ifc, rjg^j «aa<og|^Tiil£StiT 

J:5^rr-^m6»^syxh^aj*-r6. toe 

S). «o, t>4>*«*. B&Plh • • • J 
[0 0 381-*, jE»yxh2l±. UttBftfcgKO* 

• j k\*omammiiz*t!6Lx : Tit>!mzii&iffi 

[0039] 

0 3 xshr site 

5 6 «MJ A45 
20 23 3J19B Hft 
3 0 3 2 Ribjft *S 

[0040] £0>]XMC&tYt. SttOfftt, ZVfX 

v>S. CKOidt. *0KOjE»UXh2fctiV»rtt, # 

[0 04l]«SI4jK»4»i. d<oid=5f]E)By^h2 
t . JSS^BUf • fpP!X*Sft*a3^as*t&4^J 

[004 2]anylag(3) <- <@(»»6. 21).t» 
rd(_. SSt^. v»rd(^. ^Ssl^. $ 

i&sp). >@(ffl»«). 

[0 04 3] i<oasi #^ I" 2 1 j jW 

nzmw ( rwordc. m&mn ) . 

^s^. rm&j<r)m$m.<om&tLx%it>ti& 
t^ow*(?>mxt>i>. 

[0044] iOid'Srfiffl <04fl£«. X*) 

+ lfS[(wo, co. po). • • •, (Wi, Ci, 
Pi), • - . (wh, c«, pn)] TTTSTV^if 
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sm& ( r@#«aj . r^a^j 

[004 5l»SC«. lo^)^o*^)mSi>. 

mmtvH&temm^&zttk-*? . mm 
->v) lift. *0rcuu 10 

[oo47] -r*^ P^axs i £%&tiz,mim 
mttm-mi*m$m [(w 0 . co . P o>, • • 

• . (Wi. Ci, Pi), • • • , (WM, CH, Pi)] fc. 

[(wo', co', po'). • ■ - , (wi', Ci' r pi'), 
■ • ■ , (wm* , cn ' , P «')] dV>£o%m 

[0 04 8] anytag(u) <— <@(t + d f . k). wo 
rd(wo\ co', po'). • - - . (wi', ci'. Pi'), 20 
• • , t«rd(w»\ ci'. P s'), >@(t-dt). 

[0049] ^cirrtjtt. w&m(rm& mm 
r«»fej ) zmt. r+df j »±. z<^mm<^m 

mm^nmrcht. sit, r-dt j im^mi 

[0050] Mitf. rmtfmv- - -jfcv^ittt 
i&j m&Ti>&tzi>irt>he>-r. M&mffi ■ smx 30 

cop*. @*&f©. (hj 
rt. sags*, um^i)] t=5ro. aswu** 

•r*>-r*:tf>fc. r d t=i j t-r&. wmmiH' 

£,c=S:V*DT\ l"df=Oj-CfcS. 

[ 0 0 5 1 ] a*. ±a<offlBi (fv-ivy {zmt 

[00 5 2]=&^aS-#trr-^(wi', ci'. P i') 

sr-^(wi. ci, P i)wtLT. ycrmsmfn. 

ttzX^X&h. 

[OOSSlorij^roj^rNj-c. K**S<D 
£#££tfl§£ < d f >0£fcttd t>0 ) UU d#l*> 
£^R<fcUrv>. SBI 0P-/1-) <D rd f j k rdtj 

s. 50 
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©SW&PiO^li r WiJ fc^&fcts. 

[0054] %m6&&£8ix&±(Mmzwat& 

[ 0 0 5 5 ] #«B!!<Offi5fee ( u ) k LTli. 0, 
<^Srv^(1?i£cOWC»i. Jfefei: UTO r*74 h 

)ifimM&<o2»mi(imt!.tLx<o r*v 
[00 56] ^ d Lrmm^jSM4 izx ^n^n^^m 

V^T. iWW8^Sl<O#S99tJiffl-rs^kK:J:0i«t6 

[0057] m«iffl^7ica. m«sttic(±. m&m 
w®vme8mwm®mtiMt:Lx. mmz&Ji 

<. 

[0058] zn*o%ffifflmmm6iz£h*m 
mti&fflttmmt5(?>£X(om£ttLxtt 

[oo59] #hJ*Tv7m<rymjmim\>*h. 

[00 6 0] «H0»«S8tt, ClOidfcLtf^S^ 

tipmmmnr^thLx. ^wn^miu^-r 
i. t Lxim«%mmm^izktfx'£ 

nmx-»>z. zmk&fiEmmt. 

\t. m*oz^km*&bitxm»h»t:mrt-z 

fc»?£#<5»ivfir^. -td-C. #«BJ (R) (vmm 
Toioizti&LX^tZ. 
[0061] 

(O) «H!R^)7c{c^o/i®fi-^akv-yf-LT^^it 

( a ) w&^hM<m%mitfim y x h 2 t:ss$*t 
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mm&m) 



fAj , fxj <0@»*at« rofxj flJEBS^^jE -1*). <@(t + df . k), word(wo'. co". po'). 

IKOiga. roj O0fc$riE»^tsja<7>ftffli: LT^ffl • • • . (wi\ ci'. Pi '). • • • , wcrf(w»\ 

•ft. Sttfc r Aj ^T^3E»k#iifc. rffl d'. pn'). >@(t-dt), (w-i'+wi). 

+jo«t5fcfiv%gre^^aasi-&fflBW { ^Hfc** io [006 9 j zoLxmtvtdmti* jBomutQM 

[0063] #K0lfttS*8«. PH08S&7 Lfc<-Ct>. PBEfflS»7*c«oTV^7C^«B5«0-7-/ 

^AJftgOBSDOTTfeO . %<T>9A TTfcfiB [0070] £»J: 3 fc#0TCtt. fflBltf>&£#. ffi?> 

fee. fiSLts^£GsiciE»(os^^s*^L^ (®s^aiaispjs5 > &t>. x *)fm<r)&\^®m 

£xtm<rttf8&m£i5»xm.Tt>ttfi. mm 20 [ 0 0 7 1 i H3«. #^fcBij&s**syi!iHiaBJ 

wsttts-oHsr. [00721 mi^fim^^skamm. 

[oo64] «bhw»8#. «aia<o ro j . A^T-u^mm^am • Ji^**a#4a53 . 

x. mfm&9kmsmmi oimtmmxm MMttv-fttycsbo. **\ mmm-smx 

[00 6 5] «lfm9»±. @**S»iiS8B9S5«) JVWcjMIL Uf7730 1 > . #«gfcfi^t*^ 

auort. «ttr. rxj (of®^ roj <nmim. mzzcotimzmatz u-f ^7-302) . 

sai«sB9^-&. mmmi out. BtnssBiaas [0073] xt£. «i&**4e*vvc. iBB'jxh 

fl9S5^aiiort. «i« . jsa** r x j 30 2cm$mmt . *<oifi6Ffc*s#g*^fcs wit 

roj <om<v¥#o}j>bm^ mkmg&zic haul? (xfy73 0 3). mmmiugmm 

mimmmtaix. zmmwm&xQ&K* *wbl/c, »aj«8!*£jSL <xr -/t-3 04 > „ §a 

[oo66]0i*tf. m$m<vmk2semrot:it*> [0074] -etr. PtsfB«B!i»8«6fcfcvvc c 

X%ihk. ±mimx~fo&Zti. roj * rxj cff ^J:3£LT^Lfcttai8Bi]£. SttKftfcSlfcJBB 

[(w-i, c-i. p-»), (w-i, c-i, p-i), (wo, c (Xtv7305) . 

0, po), - • • , (wk.i. e B *i. ph.i). (w»»2. c [0075] $&fc. Ml^flB|g8fci5V^-C. 

p»*i). 3 k^dWUAVjW^rlstiMi fflSI-ertttJ ?iut®t^S^I0Sg (CX a. x) 

S. *ZX. #a*«BSfc(w- 2 . c-:, p-2, 40 tfVCJHSU ^WcS^t. #ttStS«B!I<^jEJS?:^i 

W-l, C-l. P-l, WH+1, CBU, PM»1. W»*2, c (Xt*/7"3 06) . 

«♦». pm)fcV>5fc»*>yxh£#£. r Oj fc«a [0076] ^<^[<0fe*, JgiFRrfi&SrJSil^lV^ 

mn*tetmt%ituf % zti\mm&cmt&g 1 (xf » r3 0 7 ) , tyi, ikec i oaiEg*^* 

<^8S-ci5 9 . aflwjsa^g^^-eco^ tfijirc s«RiffK:<i. mBDsaa 1 0 izis^x^wamaat 

#S. t. mmtL (Xf77308) . i3^a®^««4 

[0 0 6 7] fli.fcf. «£*fc«tS^S*fflV^C:kfc aj«BJ»5a»cffi»tS. Xr-yT30 5*»fe<Ojl^i& 

ix. r x j (c^isnTta^r^s* 5 r 1 o j ffl»aj$ » [ o o 7 7 ] H4 «. 0 1 izmiiwFS^mta^^ 



mum) . 

[0062] 2BHiW&8tiL ««Hlfc*tLT j . 



[0 068] anytag(u) <— Nard(w-i', c-i*. p 
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^rnm^^fyo-^-^—hvhh. *wj±. hi 

A^8ft^3tisv^T. frastsi i zmmmmftL 
mtx-mzt'cr)imi:imtz> uf7/402). 

[0078] Uct. lafcfflaSMHBSl 2fcfevvc. £ 

t Ut?T4 0 3). ^TWR»fcWL-COT«^»t 
Vmmiffo (*r?r4 04) . r£*>^ #«M 

06) . 

[00791 0Ju Hl^B4£fflV)T8HRL^J:d 

[ o o 8 o ] fcfc. dii^wttasaai* -WFfia&fc 

PHHttSlfcWai/C. *<0^#ftt»8*Sl*>if 
[0081] f Lt, ^S<o;P-;W^a*^H!rS«^ 

mv^TEMttiX^ZtPtfbfrh. *ZT. JEM0M 

mxmmtat&zttxix. m&mxmz&vz 

[008 21 znxoiz. *0!fci*ur. mnmiz^; 
tsmmxmt. toast fc*<oid^ 

W&(m5$^is£tiX^&ir>Z?mLlZJEmV x h 

ttiht. isXT&vznjfflt&^xmtiSizm 
[0083] $^>fc. "tibmistvizmtim&i&tt 

lX£1iZti&®*(nmnmt:'!&>. ifcfc. «£W> 

mmmmfrhfsmzmx'Zi. ztuzx->x. j*v> 
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twt&mmfflzmmtz z. b tfTZ h . z.<n 
Lxm*. «es:i»)±$^ - b#m&t 
[ o o 8 4 1 & *womtimmii%mT\^ mm 

mX£tJEMVxhiz£rl^X>lLfS.ZtL. to^ 

■tiaasu s^fc. mumix'hhifim.vmbhm 

[0085] N, Hl~H4S:ffl^TSH8L 

mvicm (mz. - • • ) {%m. 

[0086] Sfcfc. *0TCtt» *fa?2 5*IBS« 
flcfc l/08vvcn£#. FD^SM3i*i:L-CfflV^i 
i:Tt>&v%. Ta-fjJxnAyxY-frKmJX 
a^S2 7^LT*->'hV-^gi-Cro^ 
A^r^vn- H LX A vx Y-)V?h Z. k Xi>&\\ 
) [0087] 

o ©ta^aiaBycoes^. m<vmbimm.L 

mi] ^^m^hwmmsiia^mm^xT^ii 
[S2] ®i^vhm$^fflm£&'>x7-j*t} 



(10) 



17 



[@3 ] *m£m>hmmm&®M&m>mcm 
i -.mmm. 2: jehuxk 3:mmm-& 
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msm. 7 : mgrafesi. s : mrauHBaL 9 : mm 
mm. i o : wmmt. 1 1 : %mm. 12 -.m 
mmmm&. 13 imm^tif^mm^jxh. 

2 1 : msm. 22 : 23 : ttSBlB«Si 

24 : fi!«S(!U!5!g, 24a: CPU, 24b :£ 
25:*fa?, 26:l^&gS. 27:31 
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